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Sequence Descr [ptjon score Ej value N 

7tnji _ PF0Q00i~7 transnenbrane receptor (rhodopsin 11979 4. 7e-37 " l" 
Parsed for domains: 

Sequence Donajn seq-f seq-t hnn-f hnn-t score E_- value - 
7trTT T/l " "57 ~32l" ~ "l "259 C 3 H979 A^i'ff 
A I i gnnents of top-scor i ng dona i ns: 

7tn li domain 1 of 1, from 57 to 321: score 119.9, E = 4. 7e-37 

*->GNlLVilvilrtkklrtptnifilNLAVADLLflltlppwalyylvg 
GN+ ++++++r +++r +t +++1NL ADLLf + p++ ++ -+ 
Flhl4273, 57 GNVCALVLVAR-RRRRGATACLVLNLFCADLLFISAIPLVLAVR-WT 101 

gaadWpf Gsa 1 Ck 1 vt a I dv vnny aS i 1 1 Lta IS i DRY 1 A 1 vhP 1 ryrrr 
e W++G++ C+1+ +++++++ + i 1+L+a : S++R + Iv 1+ +r 
Flhl4373, 1 02 ~EA WLLGP VACHLLF YYMTLSGS VT I LTLA A VSLERMVC I V-HLQRGVR 1 48 

r t spr r Ak v v i 1 1 v wv 1 a 1 1 1 s 1 Pp 1 1 f s wvkt veegngt 1 nvnve vC 1 i 
+r +V+++1+W +++++1P +f+ v+ ++ ++ ++ +C++ 
Flhl4273, 149 GPGRRARAVLLAL I WGYSAVAALPLC VFFR VVPQRLPG—ADQE IS I CTL 196 

df pccstasvstw I r sy v 1 1 st 1 wgF 1 1 P 1 1 v i I vcYtr 1 1 rt 1 r 

+P++++ ++s+ +++ ++ Fl+P lvi++ Y+ II + + ++++ 
Flhl4273, 197 IWPTIPG E I S WDVSFVTLNFLVPGL V I V I SYSK I LQ I TKasrkr 240 

: kaaktUvvvvvFvlCWIPyf ivllldtlc 

+ + +++++ + ++++ ++ +tl++++v F++ W P I++11 + 
Flhl4273, 241 1 cvs 1 ay seehq i r vsqqdf RLFRTLFLLMVSFF I M WSP 1 1 ITILLILIQ 290 

. lsi insstCelervlptallvtlwLayvNsclNPi 1Y<-* 
-+ + + p ++++ + ++++Na+lNPi+Y 

Flhl4273, 291 nFK QDL V I WPSLFFWVVAPTFANSALNP I LY 321 



FIG. 2. 
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Analysis of Flh 14273, (362 aa) 
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5.1 "4.0 4.0 



| i | i | i | i | i | i | i | i | i | i | i | i | i | i | i | i | i | i | 
1 41 81 121 161 201 241 281 321 361 



>Flhl4273, 1086 bases, 1825 checksum. 

MSPECARAAQDAPLRSLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLEFAVSLLGNVC 
ALVLVARRRRRGATAQLVLNLFCADLLF YSA I PLVLAVRW TEAVLLGPVACHLLFYVMTL 
SGSVT I LTLAAVSLERMVC I VHLQRCVRGPGRRARAVLLAL IWGYSAVAALPLCVFFRVV 
PQRLPGADQE I S I CTL I WPT I PGE I SWDVSFVTLNFLVPGLV I V I SYSK I LQ I TKASRKR 
L TV SLA YSESHQ I R VSQQDFRLFRTLFLLM VSFF I MWSP 1 1 1 T I LL I L I QNFKQDLV I VP 

SLFFWVVAFTFANSALNPI LYNMTLCRNEWKK I FCCFWFGPEKGA I LTDTSVKRNDLS I I S 
G+ 



FIG. 4. 
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Transmembrane Segments Predicted by MEMSAT 
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>Flhl4273, 

MSPECARA AGDAPLRSLEQANRTRFPFFSDVKGDHRL VLAAVETTVLVL I PAVSLLGNVC 
ALVL VARRRRRGATACL VLNLFCADLLF I SAWPL VLAVRWTEAWLLGP VACHLLF Y VMTL 
SGSVT I LTLAAVSLERMVC I VHLQRGVRGPGRRARAVLLAL I WGYSAV AALPLCVFFRVV 
PQRLPGADQE I S I CTL I WPT I PGE I SWDVSF VTLNFLVPGLV I V I SYSK I LQ I TKASRKR 
LTVSLA YSESHQ I R VSQQDFRLFRTLFLLMVSFF I MWSP 1 1 1 T I LL I L I QNFKQDLV I WP 
SLFFW VVAFTFANS ALNP ILYNMTLCRNEWKK I FCCFWFPEKGA I LTDTSVKRNDLS 1 1 S 
G 

Transmembrane Segments for Presumed Mature Peptide 
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>Flhl4273, .nature 

LVLV ARRRRRGATACLVLNLFCADLLF I SA I PLVLA VRWTEAWLLGPVACHLLPY VMTLS 
GS VT I LTLAAVSLERMVC I VHLQRGVRGPGRRARAVLLAL I WG YSA VAALPLCVFFRVVP 
QRLPGADQE I S I CTL I WPT I PGE I SWD VSFVTLNFL VPGL V I V I SYSKILQI TKASRKRL 
TVSL AYSESHQ I RVSQQDFRLFRTLFLLM VSFF IMWSP 1 1 1 T ILL I L I QNFKQDLV I WPS 
LFFWVV AFTF ANSALNP ILYNMTLCRNEWKK I PCCFWFPEKG A I LTDTSVKRNDLS 1 1 SG 

FIG. 6. 
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Query: 14273m, 
Scores for sequence fanily classification (score includes all dona ins): 
Sequence Description Score E-value N 

7ir\~_l~ ' PFOOOOl" 7 transnenbrane receptor (rhodops in 11878 le-35'f 

Parsed for dona ins: 

Sequence Dona in seq-f seq_-t hnn_ - f hrtn_-t score E-value - 
7trTT f/T" 57 321.. 1 259 C ] 1~18.8 le 1 36 
AUgnnents of top-scoring dona ins: 

7tn 1: dona in 1 of 1, fron 57 to 321< score 118. 8, E = le-36 

*->GN 1 L V i 1 v i 1 rtkk 1 rtptn i f 1 1 NLAvADLLf 1 1 1 1 ppwa 1 yy 1 vg 
GN+ ++++++r +++r ++ +++1NL ADLLf+ + p++ ++ ++ 
14273m, 57 GNVCALVL VAR-RRRRGASASL VLNLFCADLLFTSA I PLVLVVR-WT 101 

gaadWpf Gaa 1 Ck 1 vta 1 dvvnny aS i 1 1 1 Lta 1 S i DRY 1 A 1 vhP 1 ryrrr 
e W++G+++C+1+ +++++++ + il+L+a S++R + Iv lr + 
1 4273m, 1 02 — EAWLLGPVVCHLLPYVMTMSGSVTILTLAAVSLERMVCIV-RLRRGLS 1 48 

rtsprr A, kvv i 1 1 vWv lalllslPpll f sw vkt veagngt 1 nvnvtvC 1 
rr+++++++++W ++1++1P ++++ v + ++q ++ +C+ 
14273m, 149 GP-GRRTqAALLAF I WGYSALAALPLY I LFRVVPQRLPGGD— QE I TPCT 195 

idfpeestasvstwlrsyvllstlvgF21Pllvi lvoYtrllrtlr. . . . 
+d+p++ + ++s+ +++ ++ Fl+P lvi++ Y+ 11 + + +++ 
14273m, 196 LDWPNR I G — I SVDVFFETLNFL VPGLV I V I S YSK I LQI TKasrk 239 

■. kaaktllvvvvvFvlCWlPyfivllldt- 

+ + + +++++ + ++++ ++ +tl++++v F++ W P i++ll + 
14273m, 240 r 1 1 1 s 1 ayseshq i r vsqqdyRLFRTLFLLMVSFF IMWSPI I ITILLILI 289 

c. lai inestCelervlptallvtlwl_ayvNsclNPilY<-* 
++ + + p ++++ + ++++Ns+lNPi+Y 

14273m, 290 QnFR QDL V I WPSLFFW VV AFTF ANS ALNP I L Y 321 

FIG. 8. 
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Analysis of 14273m, (362 aa) 



PF00Q01 



Cys 

Ngly 

out 

TM 

ins 




5.1 "5.3- 3.8 



6.5 3.2 1 



| i | i| i | i | i | i | i | i | i | i | i | I | i | I | i | I | i | i | 
1 41 81 121 161 201 241 281 321 361 



> 14273d, 1086 bases, 6943 checksum. 

MSPECAQTTGPCPSHTLDQVNRTHFPFFSDVKGDHRLVLS VVETT VLGL I F VVSLLGNVC 
ALVLVARRRRRGASASLVLNLFCADLLFTSAIPLVLVVRWTEAWLLGPVVCHLLFVVMTM 
SGSVT I LTL AAVSLERMVC I VRLRRGLSGPGRRTQAALLAF I WGYSALAALPLY I LFRVV 
PQRLPGGDQE I P I CTLDWPNR I GE I SWDVFFETLNFLVPGLV I V I SYSK I ( LQ I TKASRKR 
LTLSLAYSESHQ I RVSQQDYRLFRTLFLLMVSFF I MWSP I I I T I LL I L I QNFRQDLV I VP 
SLFFWVVAFTFANSALNP I L YNMSLFRNEWRK IFCCFPFPEKGA I FTDTSVRRNDLSV I S 



FIG. 10. 
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Prosite Pattern Matches 'for ; .14273r% 
) PS00001/ PDDCOOQ01; ASN GLYCDSYLATIDN N-glycosylation site. 

Query: 21 NRTH 24 
Query: 322 NRTH 325 

> PS00002/PBDC00002/GLYCDSAM I ND6LYC AN Glycosani nog (yean attachnent site. 
RU Additional rules: 

RU There nust be at least two acidic anino acids (Glu or Asp) fron -2 to 
-4 relative to the serine. 

Query: 148 SGPG 151 

> PSQ0004/ PDDCQ0004/CAHP PHOSPHO SITE cAMP- and cGHP-dependent protein kinase phosphorylation 
Query: 239 KRLT 242 

> PSQ0005/ PD0C00005/PKC PHDSPHD SITE Protein kinase C phosphorylation site. 

Query: 237 SRK 239 
Query: 350 SVR 352 

> PSQQ006/ PDDC000Q6/CK2 PHDSPHD SITE Casein kinase II phosphorylation site. 

Query: 40 SVVE 43 
Query: 256 SQQD 259 

> PS00008/PD0C00008/MYR I ST YL N-nyristoylation site. 

Query: 57 GNVCAL 62 
Query: 72 GASASL 77 
Query: 343 GAIFTD 348 

>PS00009/PDDC00009/AMIDATION Anidation site. ar* A A 

Query; 150 PGRR 153 ! ! ' 
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Transmembrane Segments Predicted by MEMSAT 
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> 14273m, 

MSPECAQTTGPGPSHTLDQVNRTHFPFFSDVKGDHRLVLSVVETTVLGLIFVVSLLGNVC 
AL VL V ARRRRRGAS ASL VLNLFC ADLLFTS A I PL VL VVRWTE AWLLGP V VCHLLF Y VMTM 
SGS VT I LTLAAVSLERMVC I VRLRRGLSGPGRRTQAALLAF I WG YSALAALPL Y I LFRVV 
PQRLPGGDQE I P I CTLDWPNR I GE I SWDVFFETLNFL VPGL V I V I SYSK I LQ I TKASRKR 
LTLSL A YSESHQ I RVSQQDYRLFRTLFLLM VSFF I MWSP 1 1 1 T I LL I L I QNFRQDL V I WP 
SLFFWVVAFTFANSALNP I LYMMSLFRNEWRK I FCCFFFPEKGA I FTDTS VRRNDLS V I S 
S 

Transmembrane Segments for Presumed Mature Peptide 
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>14273n, _nature 

LVLVARRRRRGASASLVLNLFC ADLLFTSA I PLVL VVRWTEAWLLGP VVCHLLFYVMGMS 
GSVT I LTLAAVSLERMVC I VRLRRGLSGPGRRTQAA 1 1 AF I WG YSALAALPL Y I LFRVVP 
QRLPGGDQE I P I CTLDWPNR I GE I SWC VFFETLNFLVPGL V I V I SYSK I LQ I TKASRKRL 
TLSLA YSESKQ I RVSQQDYRLFRTLFLLMVSFFTMWSP 1 1 1 T I LL I L I QNFRQDL V I WPS 
LFFWV VAFTFANSALNP I LYNMSLFRNEWRK I FCCFFFPEKGA I FTDTSVRRNDLS V I SS 

FIG. 12. 



